Structured language modeling

نویسندگان

  • Ciprian Chelba
  • Frederick Jelinek
چکیده

This paper presents an attempt at using the syntactic structure in natural language for improved language models for speech recognition. The structured language model merges techniques in automatic parsing and language modeling using an original probabilistic parameterization of a shift-reduce parser. A maximum likelihood re-estimation procedure belonging to the class of expectation-maximization algorithms is employed for training the model. Experiments on the Wall Street Journal and Switchboard corpora show improvement in both perplexity and word error rate—word lattice rescoring—over the standard 3-gram language model. c © 2000 Academic Press

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Advancing the Systems Analysis and Design Curriculum

Computer Information System and related programs are expected to produce students who possess a broad and contemporary understanding of analysis and design for information systems. An empirical analysis of the state of practice in systems analysis and design education revealed an emphasis on structured design in the majority of schools. The opportunity to transition to object-oriented analysis ...

متن کامل

Structured queries, language modeling, and relevance modeling in cross-language information retrieval

Two probabilistic approaches to cross-lingual retrieval are in wide use today, those based on probabilistic models of relevance, as exemplified by INQUERY, and those based on language modeling. INQUERY, as a query net model, allows the easy incorporation of query operators, including a synonym operator, which has proven to be extremely useful in cross-language information retrieval (CLIR), in a...

متن کامل

Structured Modeling Language for Automated Modeling in Causal Networks

The paper presents a structured modeling lan­ guage (SML) and a relational database framework for specification and automated genera­ tion of causal models. The framework describes a relational database scheme for encoding a li­ brary of causal network templates modeling the basic components in a modeling domain. SML provides a formal language for specifying mod­ els as structured components th...

متن کامل

Extending SOFL to Support Both Top - Down and Bottom - Up Approaches ∗

This paper presents an integrated approach to support both top-down and bottom-up design of software systems by combining UML (Unified Modeling Language) and the Formal Engineering Method SOFL (Structured Object-oriented Formal Language). We demonstrate by examples that the topdown principle used in conventional Structured Design can be effectively utilized to carry out ObjectOriented design th...

متن کامل

Semantic structured language models

In this study, we propose two novel semantic language modeling techniques for spoken dialog systems. These methods are called semantic concept based language modeling and semantic structured language modeling. In the concept based language modeling, we propose to use long span semantic units to model meaning sequences in spoken utterances. In the latter technique, we use statistical semantic pa...

متن کامل

Towards Structured Business Process Modeling Languages

A Process-Aware Information System (PAIS) is a software system driven by explicit Business Process (BP) models. A basic PAIS provides at least an execution engine, a Business Process Modeling Language (BPML) and a graphical editor. The editor is mainly used to design new BP models, maintain existing ones and check their correctness. BPMLs typically embrace an unstructured control-flow paradigm ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Computer Speech & Language

دوره 14  شماره 

صفحات  -

تاریخ انتشار 2000